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Gene iucD of the aerobactin operon of the Escherichia coli plasmid ColV-K30 encodes a membrane-bound 
enzyme synthesizing A^-hydroxylysine, the first product of the aerobactin biosynthesis pathway. The entire 
nucleotide sequence of the cloned iucD gene was determined, from which the primary and some aspects of the 
secondary structure of the encoded peptide were deduced. E. coli cells harboring multicopy plasmid pVLN12 
(iucD + ) hyperproduced an approximately 50-kilodalton peptide which was purified and identified as the 
product of the gene by examination of its amino- terminal sequence. Two iucD'-'lacZ gerie fusions were 
constructed in vitro and four iucD'-'phoA gene fusions were generated in vivo by mutagenesis of iucD with 
transposon TnphoA (Tn5 ISSO^nphoA). Analysis of the corresponding fusion proteins suggested at least two 
domains of attachment of the IucD protein to the inner side of the cytoplasmic membrane. The first apparent 
membrane-bound domain was found within the first 25 amino acids of the protein and showed a sequence which 
resembled that of the signal peptides; 



Under conditions of iron stress, most bacteria induce 
high-affinity iron transport systems mediated by the chela- 
tors named siderophores (9, 42). The most thoroughly stud- 
ied system is that encoded by the Escherichia coli virulence 
plasmid ColV-K30, which uses the siderophore aerobactin 
as an iron scavenger. This system is arranged in a single 
operon (11, 16, 24) driven by an iron-dependent promoter 
(6), which makes it ah ideal model for study of the organi- 
zation and transcriptional regulation of metal-sensing genes. 
Furthermore, the aerobactin system seems to be an impor- 
tant virulence determinant in nosocomial infections (for a 
review, see reference 14) and hence constitutes a potential 
target for new antimicrobial drugs. 

The entire aerobactin system spans about 8 kilobases (kb) 
of pColV-K30, including a cluster of five genes transcribed 
coordinated through a strong promoter (6) located at the 
front of the first gene (Fig. 1). Transcription of the operon is 
controlled, as in other iron-regulated systems in E. coli, by 
the product of the fur gene (2, 26), a repressor protein which 
uses Fe(II) as corepressor to bind to an operator region in 
the aerobactin promoter (4, 17). Four of the five genes (iucA 
to iucD) are involved in the biosynthesis of the siderophore, 
while the remaining gene (iutA) determines an outer mem- 
brane receptor for the ferri-aerobactin complex. The func- 
tion of each gene in subsequent steps of siderophore synthe- 
sis is known (16, 24), and their peptide products have been 
identified. In two cases {iucB and iutA), the proteins have 
been purified and their activity has been assessed in vitro 
(13, 34). 

The product of the gene iucD is of special interest for two 
reasons. First, it catalyzes a unique type of enzymatic 
reaction which does not occur in animal tissues, namely, the 
A^-oxygenation of lysine. Second, iucD mediates the first 
step of aerobactin biosynthesis, and hence effective inhibi- 
tors of its activity may have therapeutic value. 

A previous study (32) had shown the product of gene iucD 
to be bound to the cytoplasmic membrane, although the 
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function of the protein was uncertain at that time. In the 
present work we have determined the complete nucleotide 
sequence of the iucD gene and purified the corresponding 
peptide product. Structural features of the protein and 
details of its association to the inner membrane of E. coli 
were suggested by the analysis of iucD'-'phoA and iucD'- 
'lacZ gene fusions. 

MATERIALS AND METHODS 

Culture media, strains, and plasmids. Unless otherwise 
indicated, ail cultures were grown at 37°C in LB medium (41) 
to which adequate levels of antibiotics (either ampicillin or 
ampicillin plus kariamycin) were added to ensure retention of 
the plasmids. Strains used are listed in Tabie 1. E. coli 71-18 
was used as the host of all recombinant Mi3 clones for 
sequencing. E. coli LE392 was used for lytic growth of X 
TnphoA (Tn5 IS50^::phoA), and E. coli CC118 was tised as 
the recipient for TnphoA mutagenesis of iucD (see below). 
Plasmid pVLN12 (Fig. 1) contains an approximately 3.1-kb 
BamHl-EcoRl DNA fragment from the aerobactin operon 

(16) cloned at the corresponding - sites -of the pUC9 vector 
(52). This plasmid contains the complete sequence of iucD 
plus partial sequences of the flanking genes in the operon 

(17) , /wcC, and iutA (Fig. 1). The iucD gene in pVLN12 is 
expressed through the strong lac promoter of pUC9 (54). 
Plasmids resulting from the construction of phoA and lacZ 
gene fusions (see Table 2) will be described in the Results 
section. 

DNA manipulation, enzymatic assays, and chemical deter- 
minations. All DNA techniques, including plasmid construc- 
tion, ligation, preparation of competent E. coli cells, trans- 
formation, and fragment purification; were carried out by 
standard procedures (37) or following the directions of 
commercial suppliers. Restriction endonucleases were pur- 
chased from New England BioLabs, Inc., and "slow" Bal 31 
nuclease (1 U/uJ) was from International Biotechnologies, 
Inc. p-Galactosidase activity was measured by the method 
of Miller (41), and alkaiine phosphatase activity was deter- 
mined as previously described (40). The IucD + phenotype 
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FIG. 1. Strategy for sequencing the iucD gene. Top: Genetic 
organization of the aerobactin operon lined up with a partial 
restriction map of the corresponding DNA segment of pColV-K30. 
The native iron-regulated promoter of the system (6) is located in 
front of iucA and is indicated by an arrow. Bottom: Enlarged 
representation of the portion of the aerobactin cluster cloned in 
pVLN12 which contains the iucD gene. The orientation of the lac 
promoter from vector pUC9 (through which the iucD gene is 
transcribed in pVLN12) is indicated, as are the restriction sites used 
:o generate M13 clones for sequencing. Thin arrows indicate the 
location and orientation of the inserts in those M13 clones. Abbre- 
viations: A, AccI; B, BamH\\ Bg f '/Jg/II; E, EcoRl; Hd, /KndHI; H, 
HpalU K. Kpn\\ N, Ncol\ Na, Narl\ P, Pvu\l\ S, SaulAl; Sa, Sail; 
Sp, SpM;T, Taq\\ X, Xmnl. 

was tested by assaying for hydroxylamine in the culture 
supernatants of the corresponding strains with the Csaky 
test, as" modified by Gillam et al. (21). 

Sequencing strategy and sequence analysis. The approxi- 
mately 2.3-kb DNA fragment generated by treating pVLN12 
with the restriction endonuclease AccI, which overlaps most 
of the insert in pVLN12 (Fig. 1), was purified and digested 
separately with the enzymes SaulAl and Hpall. The result- 
ing fragments were repurified from polyacrylamide gels (37) 
and cloned at compatible sites of the replicative forms of 
M13mpl8 and M13mpl9 phages. The inserts of single- 
stranded forms of the resulting constructions were se- 
quenced by the dideoxynucleotide method (46). To confirm 
and complete the sequence of both DNA strands of the 
2.3-kb AccI segment, Xmnl, Sphl, Kpnl, TaqI, BgM, and 
Narl restriction endonucleases were used to generate addi- 
tional fragments which were isolated, cloned, and sequenced 
by the same procedure (Fig. 1). The nucleotide sequence 
obtained was analyzed with a Sequence Analysis progran} 
from the Department of Biochemistry and Biophysics of the 
University of California at San Francisco and with a Mac 
Gene program from Applied Genetic Technology (Fairview 
^ark, Ohio). 

Protein techniques. Proteins hyperexpressed by E. coli 
strains carrying recombinant plasmids were isolated directly 
from whole-cell lysates of the corresponding strains as 
follows. After overnight growth in LB medium (41) with 
appropriate antibiotics, cells were spun down, suspended in 
sodium dodecyl sulfate-polyacrylamide gel electrophoresis 
(SDS-PAGE) sample buffer (36), boiled for 3 min, and loaded 
in 8 to 10% polyacrylamide gels (36). The position of the 
protein bands of interest with respect to the migration of 
prestained molecular weight standards (Sigma Chemical 
Co.) was known in advance, and the corresponding slice of 
the unfixed gel was cut out and electroeluted in 80 mM TBE 



buffer pH 8.3 (37) with an electroseparation device (Elutrap; 
Schleicher & Schuell) (30). The peptidic material recovered 
was precipitated in 10% trichloroacetic acid, washed in 
ethanol, and lyophilized. Analysis of the amino-terminal 
sequence of electroeluted protein bands was carried out in 
the Department of Biochemistry and Biophysics of the 
University of California at San Francisco. 

Periplasmic proteins were released from E. coli cells by 
the chloroform shock procedure (18), dissolved in SDS- 
PAGE sample buffer (36), and analyzed in polyacrylamide 
gels as described above. 

Spheroplasts from cells carrying gene fusion plasmids 
were prepared and analyzed as follows. Overnight cultures 
of the E. coli strains harboring the different plasmids were 
spun down and suspended in the same volume of a solution 
containing 2 mM EDTA, 0.58 M sucrose, and 30 mM Tris 
hydrochloride (pH 8). Lysozyme was then added to 100 
fjLg/ml. Conversion of cells into spheroplasts was monitored 
with a light microscope. Spheroplasts were spun down, 
suspended in an equal volume of the same buffer without 
lysozyme, and split into two parts, one of them being treated 
with trypsin (0.6 u,g/ml) (Sigma, T8642) for 30 min at 22°C. 
Treated and untreated spheroplasts were then spun down, 
suspended directly in SDS-PAGE sample buffer, boiled for 3 
min, and analyzed in an SDS-PAGE system (36) as described 
above. 

Construction of gene fusions. Alkaline phosphatase gene 
fusions to iucD were obtained in vivo by infecting E. coli 
CC118 harboring pVLN12 (iucD + ) with X TnphoA as de- 
scribed (25). After infection, cells were plated on LB agar 
with kanamycin (300 fig/ml) and 40 u,g of the chrpmogenic 
substrate 5-bromo-4-chloro-3-indolyl phosphate (XP; Sigma) 
per ml. High-kanamycin-concentration-resistant colonies 
were examined for the PhoA + phenptype (blue in plates with 
XP) and for the production of hydroxylamine. Insertions of 
TnphoA into pVLN12 were mapped by restriction enzyme 
analysis of the corresponding plasmids. When necessary, the 
precise boundaries of the iucD'-'phoA sequence junction 
were determined by sequencing, in an M13 system (see 
above), restriction fragments overlapping the junctions. 



TABLE 1. Characteristics of relevant £. coli strains, 



phages and plasmids 


Strain, phage, 
or plasmid 


Genotype or description — 


Source or 
' reference 


E, coli 






71-18 


&(lac proAB) thi supE F' proAB 
lacFZAAflS 


R. Kolter 


CC118 


F" A {ara-leu)7697 araD139 
MacX74 galE galK phoA20 thi 
rpsE rpoB argE(Am)recAl 


C. Manoil 


LE392 


F" supF supE hsdR galK trpR metB 
lacYtonA 


C. Manoil 


X TnphoA 


bill cI857 Pdixnh with TnphoA in or 
near rex 


C. Manoil 


Plasmids 






pMLB1034 


Ap r lacZ gene fusion vector 


48 


pVLN12 


Ap r iucD+ 


16 


pOV7 


Ap r iucD'-'lacZ 


This study 


pOV9 


Ap r iucD'-'lacZ 


This study 


pOV5 


Ap r Km r iut A' : .TnphoA 


This study 


pOV25 


Ap r Km r iut A' ..TnphoA 


This study 


pOV33 


Ap r Km r /wc£>'::Tnp/ioA 


This study 


pOV99 


Ap r Km r iucD' "TnphoA 


This study 


pOV48 


Ap r Km r iucD'y.TnphoA 


This study 


pOV28 


Ap r Km r iucD' \\TnphoA 


This study 
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Two p-galactosidase gene fusions were obtained in vitro 
by Iigating the 3.3-kb Smal-Ball fragment of pMLB1034 
vector (48), containing a lacZ gene devoid of promoter and 
of ribosome-binding sequence, into plasmid pVLN12 di- 
gested with either Kpnl orNcol and treated for a short time 
with Bal 31 enzyme. This last treatment was made to ensure 
the presence of all possible reading frames. The ligation 
mixtures were used to transform E. coli CC118! Transfor- 
mant blue colonies arising in LB with 150 ^g of ampicillin 
and 40 jig of 5-bromo^TchIoro-3-indolyl-p-p-galactoside 
(Sigma) solid medium per ml were picked for further analysis 
(see Results section). 

RESULTS 

Sequence of the iucD gene. pVLN12 (16) carries the active 
iucD gene in a 3.1-kb insert from pColV-K30 in pUC9. As 
the sequence of the following gene {iutA) is known (34) as 
well as the transcriptional orientation (6), we focused our 
attention on a 2.3-kbAccI restriction fragment of pVLN12 in 
which the iucD gene sequence was expected to map. Figure 
1 shows the collection of inserts in M13 clones which were 
used for sequencing the entire region. Each nucleotide was 
sequenced an average of three times. 

Only one open reading frame of sufficient length to encode 
the IucD protein, previously sized in minicell systems in the 
range of 50 kilodaltons (kDa) (11, 16, 24, 32, 33), could be 
found with the transcriptional orientation of the aerobactin 
operon. The sequence of that open reading frame is shown in 
Fig. 2. This structural sequence (1,278 base pairs [bp]) 
contained an ATG start codon located 9 bp downstream 
from an AGGA sequence which might form part of a 
ribosome-binding site (47). The putative structural region 
spanned 426 amino acid residues and ended with the stop 
codon TAG. Of the residues, 48% were hydrophobic, 26% 
were polar uncharged, 15% were positively charged, and 
11% were negatively charged. The predicted molecular 
weight of this peptide was 48,968. Figure 3 shows the 
hydrophobicity of the protein throughout the entire se- 
quence according to Kyte and Doolittle (35). 

The sequences upstream and downstream of the iucD 
structural gene were also examined. Most of the approxi- 
mately 850 bp between the BamHl site and the start of the 
iucD gene overlapped the structural sequence of the preced- 
ing gene in the operon, iucC, although the precise bound- 
aries are unknown. No evident promotorlike sequences, i.e., 
-35 and -10 consensus sequences, were detectedjeither up- 
or downstream of iucD. Interestingly, the intercistronic 
regions at both sides of iucD contained short regions of dyad 
symmetry (Fig. 2). The dyad preceding the iucD structural 
gene would allow the formation of a stem-loop structure in 
the corresponding transcript with a calculated AG value of 
-20.6 kcal/mol, while the dyad succeeding the gene would 
have a AG of -17.6 kcal/mol. 

Isolation and identification of the iucD protein product. £. 
coli cells harboring pVLN12 hyperproduced an approxi- 
mately 50-kDa peptide when analyzed in SDS-PAGE sys- 
tems (not shown). This polypeptide had a molecular weight 
similar to that of a protein identified in minicells as the 
product of the iucD gene (16, 24). To ascertain whether this 
peptide was indeed the product of the gene and to check the 
reading frame obtained from nucleotide sequence data, we 
purified this protein band and subjected it to amino-terminal 
analysis. E. coli CC118 harboring the plasmid pOV25 was 
used as the source of the protein (see below). For unknown 
reasons, this plasmid construction afforded further hyperex- 



pression of the approximately 50-kDa protein to a level close 
to 5% of the total cell protein (see Fig. 5). The protein band 
was purified from a whole-cell extract by electroelution from 
a polyacrylamide gel. The initial 15 amino acids of the 
isolated polypeptide were identified as NH 2 -Met-Lys-Lys- 
Ser-Val-Asp-Phe-Ile-Gly-Val-Gly-Thr-Gly-Pro-Phe. These 
are. the same residues expected from the translated se- 
quence, including the initial methionine as the amino termi- 
nus. This result allowed the unequivocal assignment of the 
approximately 50-kDa protein as the product of the iucD 
gene. 

phoA and lacZ gene fusions to iucD. Alkaline phosphatase 
in E. coli must be exported into the periplasm to be active 
(28, 40). Therefore, phoA gene fusions are a convenient 
method to probe the topology of putative membrane-bound 
and exported proteins (7, 38, 39). To construct a number of 
iucD'-'phoA gene fusions, we mutagenized an E. coli strain 
harboring the plasmid pVLN12 with TnphoA and selected 
insertions of the transposon in the multicopy plasmid with a 
high concentration of kariamycin. Should the product of 
iucD be a membrane-associated protein and should it have 
domains facing the periplasm, colonies with a PhoA + phe- 
notype concomitant with the loss of hydroxylamine produc- 
tion (iucD) should be isolated after mutagenesis. A num- 
ber of blue colonies in XP medium indeed appeared after 
TnphoA mutagenesis, and they were picked for further 
analysis. Most of the deep-blue colonies resulted from 
insertions in tlie sequence of iutA present in pVLN12. This 
was expected since the truncated sequence of iutA remaining 
in pVLN12 (Fig. 4) keeps the intact signal peptide sequence 
necessary for protein export (34). The appearance of these 
conspicuous PhoA + colonies due to insertions in iutA con- 
stituted an internal control for the performance of the 
procedure. 

Figure 4 summarizes the location and orientation of the 
TnphoA insertions found. Four insertion derivative plas- 
mids, pOV33, pOV99, pOV48, and pOV28, conferred a 
PhoA + IucD phenotype and were studied in more detail. 
As a complement to the phoA fusions, we also obtained two 
iucD'-'lacZ gene fusions in vitro as described in the Mate- 
rials and Methods section, which were termed pOV7 and 
pOV9. The precise positions of fusion junctions were con- 
firmed by nucleotide sequencing, and they are indicated in 
Fig. 3 and Table 2. 

Analysis of lucD'-'PhoA and Iuc'-'LacZ fusion proteins. 
The levels of alkaline phosphatase and p-galactosidase af- 
forded by E. coli cells carrying different phoA and lacZ 
fusions are shown in Table 2. All four iucD'-'phoA fusions 
(carried in pOV33, pOV99, pOV48, and pOV28) afforded 
phosphatase activity, although the levels varied significantly 
in each fusion. Cells harboring pOV33 and pOV99 showed 

15- to 20-fold-higher levels of phosphatase than those with 
pOV48 and pOV28, while an iutA'-'phoA fusion used as a 
positive control of effective export (pOV5) had levels five- 
fold above those of the most active iucD'-'phoA fusion 
(pOV33). 

Plasmids with iucD'-'lacZ fusions (pOV7 and pOV9) af- 
forded rather high values for p-galactosidase activity (Table 
2). Permeabilization of cells prior to the assay resulted in a 

16- fold increase in P-galactosidase activity detected in cells 
carrying pOV7, while the same treatment produced an 
increase of only 2.5-fold in those with pOV9. 

Figure 5 shows the protein profile ofE. coli cells harboring 
the plasmids with the iucD'-'phoA and iucD'-'lacZ fusions 
analyzed in an SDS-PAGE system. In all cases, hyperpro- 
duction of a protein of the approximate size of the predicted 
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CC(WTAAAACTGACCTC(X!CTCATCTGGAT<KfCGGCAGCCCCATCW 



20 



30 



j _1Q . 

*«t Lva Lva Ser Val Asd Pha lie Cly Val Gly Tbr Gly Pro Phe Asn Leu Ser He Ala Ala Uu Ser His Gin He Glu Glu Leu Asp 

m m ul m m gat m m got g?a ggg aca gcc cca ttt aat ctc ago att got ccg ttg tca cat cag atc gaa caa ctg gac 

40 50 60 

Cys Leu Phe Phe Asp Glu His Pro His Phe Ser Trp His Pro Gly Het Leu Val Pro Asp Cys His Ket Gin Thr Val Phe Leu Lys Asp 
TGT CTC TTC TTT GAT CAA CAT CCT CAT TTT TCC TGC CAT CCG GGT ATG CTG GTA CCG CAT TGT CAT ATG CAG ACC GTC TTP CTG AAA GAT 

70 00 90 

Leu Val Ser Ala Val Ala Pro Thr Asn Pro Tyr Ser Phe Val Asn Tyr Leu Val Lys His I*s Lys Phe Tjr Arg Phe Leu Thr Ser Arg 
CTG CTC ACT GCT GTT GCA CCT ACA AAT CCC TAC ACT TTT GTT AAC TAT CTG GTG AAG CAC AAA AAG TTC TAT CGC TTC CTT ACA AGC AGA 

100 110 120 

Leu Ar« Thr Val Ser Arg Glu Glu Phe Ser Asp Tyr Leu Arg Trp Ala Ala Glu Asp Met Asn Asn Leu Tyr Phe Ser His Thr Val Glu 
CTA CGT ACA GTA TCC CGT GAA GAG TTT TCT GAC TAC CTC CGC TGG GCT GCT GAA CAT ATG AAT AAC CTG TAT TTC ACT CAT ACC GTT GAA 

150 140 150 

Asn He Asp Phe Asp Lys Lys Arg Arg Leu Phe Leu Val Gin Thr Ser Gin Gly Gin Tyr Phe Ala Arg Asn He Cys Leu Gly Thr Gly 
AAC ATT GAT TTC GAT AAA AAA CGT CGA TTG TTT CTG GTG CAA ACC AGC CAG CCA CAA TAT TTT GCC CGC AAT ATC TGC CTT GCT ACA GGA 

160 170 180 

Lys Gin Pro Tyr Leu Pro Pro Cys Val Lys His Het Thr Gin Ser Cya Phe His Ala Ser Clu Ser Asn Leu Arg Arg Pro Asp Leu Ser 
AAA CAA CCT TAT TTA CCA CCC TGT GTG AAG CAT ATG ACA CAA TCC TGT TTC CAT GCC ACT GAA ACT AAT CTT CGT CGC CCG GAT CTT ACT 

190 200 210 

GIt Lts Ant He Thr Val Val Gly Cly Gly Gin Ser Gly Ala Asp Leu Phe Leu Asn Ala Leu Arg Gly Clu Trp Gly Glu Ala Ala Glu 
GGA AAA CGG ATA ACC CTG CTT GGT GGA GGA CAG ACT CGT GCA CAC CTG TTC CTT AAT GCA TTA CGC GGG GAA TGG GCA GAA GCG GCC GAA 

220 250 240 

He Aan Trp Val Ser Arg Arg Asn Asn Phe Asn Ala Leu Asp Glu Ala Ala Phe Ala Asp Asp Tyr Phe Thr Pro Clu Tyr lie Ser Cly 
ATA AAC TGG GTG TCC CGC CCT AAT AAT TTT AAC GCA CTG CAT CAG GCT CCT TTT GCT GAT GAT TAT TTT ACA CCT GAA TAT ATT TCA GCC 

250 260 270 

Phe Ser Gly Leu Clu Glu Asp He Arg His Gin Leu Leu Asp Glu Gin Lys Thr Asp He Cly Trp His His Cys Pro lie Leu Leu Leu 
TTC TCC GGA CTG CAG GAA GAT ATT CGC CAT CAG TTA CTG CAT GAG CAG AAA ACT GAC ATC GGA TGG CAT CAC TGC CCG ATT CTT TTA CTG 

280 . 290 500 

Thr He Tyr Arg Glu Leu Tyr Hia Arg Phe Glu Val Leu Arg Lys Pro Arg Asn lie Arg Leu Leu Pro Ser Arg Ser Val Thr Thr Leu 
ACC ATT TAT CGT GAG TTG TAC CAC CGT TTT GAA GTT CTG AGA AAA CCA ACA AAT ATC CCT CTG CTA CCC AGC CGC TCG GTA ACA ACT CTG 

510 320 330 

Glu Ser Ser Gly Pro Gly Trp Lye Leu Leu Met Glu Hia His Leu Asp Cln Cly Arg Glu Ser Leu Clu Ser Asp Val Val lie Phe Ala 
GAA ACT ACT GGT CCT CGC TGG AAG TTG CTG ATG GAG CAT CAT CTG GAT CAG GCC AGG GAG AGC CTG GAA ACT GAT GTG CTG ATT TTC GCC 

540 350 360 

Thr Gly Tyr Arg Ser Ala Leu Pro Gin He Leu Pro Ser Leu Met Pro Leu He Thr Met His Asp Lys Asn Thr Phe Lys Val Arg Asp 
ACA GCT TAC CGT TCT GCG TTG CCA CAA ATA CTT CCC TCA CTG ATG CCC CTG ATC ACC ATG CAC GAT AAG AAC ACC TTT AAA GTG CGT CAT 

370 380 390 

Asp Phe Thr Leu Glu Trp Ser Cly Pro Lys Clu Asn Asn He Phe Val Val Aan Ala Ser Met Gin Thr His Gly He Ala Clu Pro Gin 
CAC TTC ACT CTG CAA TGC ACT GGC CCG AAA GAG AAC AAC ATC TTC GTG CTC AAC GCC ACT ATG CAA ACC CAT GGC ATC GCC CAA CCC CAG 

400 410 — , 420 

Leu Ser Leu Met Ala Trp Arg Ser Ala Arg He Leu Asn Arg Val Met Cly Arg Asp Leu Phe Asp Leu Ser Met Pro Pro Ala Leu He 
CTC AGC CTG ATG GCA TGG ACA TCT CCA CGT ATT CTT AAT CGC GTA ATG GCA CGT GAT TTA TTC CAT CTC AG^ATG CCG CCC GCC CTG ATT 



426 

Gin Trp Arg Ser Gly Thr AM 
CAG TCG CGC ACC GGC ACC TAG 



SD 

GCAAAACCCAGCCCGAGGCTCCTTCTTTAACTCCCTACACAGCATCTT^ 



FIG 2 Nucleotide sequence of iucD. Nucleotide sequence of the DNA region of P VLN12 containing the iucD gene of the aerobactin 
oneron. The amino acid sequence of the predicted translation product is shown above the nucleotide sequence The numbers above each line 
refer to the number of amino acid residues. The segment marked with a line corresponds to the amino-terminal residues of the punned IucD 
product determined by protein sequencing (see text). Two potential Shine-Dalgarno (S.D.) ribosome-bindmg sites (47) are indicated. The one 
assigned to iucD is located 9 bp upstream of the initiation codon ATG; the signal at the end of the sequence has been previously ascribed to 
iutA y the next gene in the aerobactin operon (34). Two short regions of dyad simmetry located in the intercistronic sequences surrounding the 
iucD gene are shown by paired arrows. The unique Sph\ restriction site in pVLN12 is also indicated. 



fusion product (Table 2) was observed. Incidentally, we 
observed that cells containing a fusion plasmid named 
pOV25 excreted increased amounts of hydroxylamine into 
the culture (as detected with the Csaky test modified by 
Gillam et al. [21]) and produced a stronger 50-kDa band 
when analyzed in an SDS-PAGE system. pOV25 consisted 
-Of a pVLN12 derivative with a TnphoA insertion in iutA 



oriented opposite to iucD transcription (Fig. 4). E. coli cells 
carrying this construction were subsequently used as the 
source of the approximately 50-kDa protein. 

To check whether the IucD'-'PhoA and IucD'-'LacZ 
fusion proteins could be entirely located in the periplasm, 
the corresponding E. coli cells were subjected to chloroform 
shock to release periplasmic proteins (18). No fusion protein 
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nr , (0 , 3 : Hydropathicity profile of IucD protein. The hydropathic index was determined at a span setting of 7 residues with the algorithm 
preset, ed by Kyte and Doolittle (35). Hydrophobic regions extend above the middle line. The arrows show the location of the in^TleS 
to me tour mcD - phoA and the two iucD'-'lacZ gene fusions studied in this work. 



was observed in the shock fluid (not shown), ruling out a free 
periplasmic location. 

To ascertain whether domains of the fusion proteins could 
be accessible from the outside of the cytoplasmic membrane, 
spheroplasts from every fusion-containing E. coli strain 
(Table 2) were prepared, treated with a low concentration of 
trypsin, and subsequently analyzed in SDS-PAGE systems. 
The protein fusion bands from pOV33, pOV99, pOV48, 
pOV28, and pOV9 faded from spheroplast proteins after 
trypsin treatment, while that of pOV7 remained insensitive 
to protease (Fig. 6). Interestingly, in spheroplasts from E. 
colt carrying the intact iucD gene, the approximately 50-kDa 
product was not significantly affected by trypsin (Fig. 6). 

DISCUSSION 

Results from different groups (16, 24) support the current 
hypothesis that the biosynthetic route of the siderophore 
aerobactin in pColV-encloded systems is identical to that 
proposed by Appanna et al. (1) for Aerobacter aerogenes 
62-1, the microorganism for which the production of aero- 
bactin was first described: In this scheme, the very first step 
leading to aerobactin production is the A^-hydroxylation of 
L-lysine, followed by an A^-acylation of the corresponding 
A^-hydroxylysine and an eventual condensation of two 
A^-hydroxy-A^-acetyl lysine moieties with a citric acid 
- backbone (1, 16, 24). 

The presence of an approximately 50-kDa membrane- 
bound protein as the product of one of the genes of the 
aerobactin system of pCoIV-K30 was first shown by Krone 
et al. (32) and assigned a role in the transport of aerobactin. 
However, overwhelming evidence from different laborato- 
ries later demonstrated that the peptide was instead involved 
in aerobactin biosynthesis (11, 44), in particular in the step of 
A^-hydroxylation of L-lysine (16, 24) which precedes the 
eventual synthesis of the aerobactin molecule.This assign- 
ment, based on complementation analysis (23) and studies 
on accumulation of aerobactin precursors by different mu- 
tants (16, 24), is consistent with recent in vitro studies made 
with the unpurified protein (27). In addition, Coy et al. (13) 
showed that A^-hydroxylysine is the substrate for the puri- 
ne ac etylase, the enzyme which mediates the next step in 
the biosynthetic pathway. Some remaining uncertainties (20) 
in the biosynthetic pathway of aerobactin synthesis have 
been addressed elsewhere (3). The gene of the aerobactin 



operon encoding this protein was termed iucD (16) or, 
alternatively, aerA (23). 

The complete sequence of the gene is displayed in Fig. 2 
and to our knowledge is the first reported for a biosynthetic 
gene of a siderophore. The sequence accounts for the 
production of a predicted 48,968-dalton peptide, a value very 
close to the approximately 50 kDa observed in mini- and 
maxicell systems by several laboratories (11, 16, 24, 32, 33). 

IucD protein is a moderately hydrophobic polypeptide. 
Analysis of the 15 amino- terminal residues of the gene 
product reveals that mature IucD protein is the unprocessed 
translational product of the iucD gene sequence, including 
the initial ATG codon. The amino-terminal sequence of IucD 
also resembles that of known signal peptides in that it starts 
with basic amino acids (lysines at positions 2 and 3) followed 
by a region rich in hydrophobic residues (Fig. 2). Uncleaved 
leaderlike sequences are frequently found in inner mem- 
brane proteins of E. coli, as for instance in the Tet protein 
from Tn/0 or pBR322 (43), the Tsr protein (serine chemore- 
ceptor [8]), the Tar protein (aspartate chemoreceptor [451), 
and others. 

Analysis of codon usage in iucD suggests that it is a rather 
weakly expressed gene since nonoptimal codons are fre- 
quent throughout the sequence, thus possibly reducing the 
rate of translation. A remarkable feature is the appearance at 
positions corresponding to amino acids 11, 12, and.13 of a 
group of three codons (GGG, ACA, and GGG) which are 
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FIG. 4. Positions of TnphoA insertions in plasmid pVLN12. The 
open arrows indicate the position and orientation of different Tn- 
phoA insertions throughout the 3.1-kb insert of pVLN12, the 
derivative plasmids being identified with the same pOV serial 
number (see Tables 1 and 2). Solid arrowheads represent insertions 
that generate in-frame translational sequences. The striped segment 
of the figure symbolizes the vector portion of constructions. 
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l TABLE 2. Characteristics of 'p ho A and 'lacZ gene fusions in E. coli CC118 

Protein size (kDa) f tivit V* 

- . AP activity* No. of IucD Hydroxylamine tu/uu «» ) 

Plasm»d Gene fusion res idues in hybrid* — ~ "— production^ Pcn neabilized i^t 

Predicted* Found' „ „ 

cells cells 



r>OV25 

pOV33 

pOV99 

pOV48 

pOV28 

pOV5 

pOV7 

pOV9 



None 

iucD'-'phoA 
iucD'-'phoA 
iucD'-'phoA 
iucD'-'phoA 
iutA'-'phoA 
iucD'-'lacZ 
iucD'-'lacZ 



<1 


426 


49 


47.5 




375 


14 


48.5 


50 




110 


2i 


49.5 


51 




12 


160 


65 


61 




10 


357 


88 


87 




1,700 


N.A. . 


N.D. 


57 


+ 




42 


120 


120 






380 


159 


152 





15,000 
1,500 



1,100 
580 



a Construction and structure of plasmids pOV25, pOV33, pOV99, pOV48, pOV28, pOV5. pOV7, and pOV9 are described in the text and in Table 1. 
Plasmids were harbored in all cases by E. coli CC118. 

* Alkaline phosphatase (AP) activity was assayed by measuring the rate of /?-nitrophenylphosphate hydrolysis in permeabilized cells (40). E. coli CC118 
cells carrying the different plasmids were grown in LB liquid culture, washed, and suspended in medium 121 (31) prior to the assay. Values are the averages 
of three independent experiments. 

c Number of amino acid residues of the IucD product present in the hybrid protein. Every DNA junction fragment was cloned in phage M13 derivatives; 
its nucleotide sequence was obtained by the dideoxynucleotide chain termination method (46), and from this the amino acid sequence was deduced. N.A., 
Not applicable (pOV5 contains an /u/A'-'phoA gene fusion and a complete iucD gene). 

d Calculated from the position of the TnphoA insertion, assuming 47 kDa as the molecular weight of alkaline phosphatase in TnphoA (38), and an average 
molecular weight of 115 for the amino acid residue in the IucD product. In the case of iucD'-'lacZ fusions, 116 kDa was the mass considered the 
p-galactosidase contribution (41). N.D., Not determined. 

* Determined for hybrid proteins separated by SDS-PAGE by their rates of migration relative to proteins of known molecular weight. 
/ Detection of hydroxylamine-related compound(s) in culture supernatants by the Csaky test as modified by Gillam et al. (21). 

* E. coli CC118(pOV7) and E. coli CC118(pOV9) cells were washed in Z buffer (41), and their p-galactosidase activity was determined either on cells 
: - rmeabilized with chloroform and SDS (total activity) or on intact cells to estimate the activity independent of Lac Y permeation (49). 



very unusual in the normal codon usage of E. coli (22). 
Clustering of rare codons in signal sequences seems to 
further decrease the rate of translation (51), which is be- 
lieved to facilitate early interactions of proteins with the 
inner membrane (10). 

Some features of the mode of association of this protein 
with the inner membrane were suggested by the study of a 
number of iucD'-'phoA and iucD'-'lacZ gene fusions. The 
rationale for this approach has been discussed in detail 
elsewhere (38-40). In summary, alkaline phosphatase is 
inactive when localized in the cytoplasm (28, 40), and 
therefore, the activity of a 'phoA gene fusion will depend on 
the translocation of the hybrid protein to the periplasm, or at 
least the 'PhoA moiety of the fusion must face the periplasm 
(38, 39). A similar but less stringent approach can be applied 
to 'lacZ gene fusions carried by lacY mutants (defective in 
lactose permease). In these cells, a hybrid protein localized 
in the cytoplasmic membrane will afford a remarkable in- 
crease in (3-galactosidase activity (determined by hydrolysis 
of o-nitrophenyl-p-D-galactoside) after permeabilization of 
the cells if the 'LacZ moiety of the protein is cytoplasmic, 
while a smaller difference will be observed if it faces the 
periplasm (50). { 

Four iucD'-'phoA fusions conferring a PhoA + phenotype 
were isolated (Table 2). All of them were stable and consti- 
tuted a major protein component of the cells carrying the 
corresponding plasmids (Fig. 6). This alone rules out the free 
cytoplasmic location of the fusions, since alkaline phospha- 
tase is generally quite unstable when exposed to the cyto- 
plasm (38). The hybrid proteins encoded by pOV33 (at the 
14th amino acid residue of IucD) and pOV99 (at residue 21) 
(Fig. 3 and Table 2) afforded a significant level of alkaline 
phosphatase activity. These proteins did not appear in the 
periplasmic fluid after chloroform shock, suggesting that the 
'PhoA moiety of the fusion is anchored in the inner mem- 
brane and facing the periplasm. Since no information re- 
quired to act as an export signal or to block export is found 
in either the alkaline phosphatase or the 17 residues contrib- 
uted by the IS50 L of TnphoA (38)r we conclude that the 



translocation to the periplasm of 'PhoA moieties and the 
final anchoring at the membrane of the two smaller IucD'- 
'PhoA proteins is promoted by the sequence of the first 14 
residues from IucD present in both fusions. It is likely that 
part of these 14 residues forms a thermodynamically stable 
association with the membrane, probably following a model 
of membrane-spanning domain closely related to those dis- 
cussed by Davis et al. (15). These authors have shown that 
not only hydrophobic a-helices but also strings containing 
only 11 hydrophobic amino acids can generate membrane- 
spanning domains. Recent studies have also demonstrated 
that an anchoring domain can be generated by a short stretch 
of hydrophobic and neutral residues (19), and even the five 
to eight consecutive hydrophobic amino acids near the N 




FIG. 5. Analysis of hybrid proteins synthesized in £. coli CO 18 
cells carrying different fusion plasmids. The proteins present in 
crude extracts of cells were separated by SDS-PAGE (8% acryl- 
amide) and stained with Coomassie blue. Lanes: M, molecular 
weight markers (in thousands); 1, control E. coli CC118, no plasmid; 
2, control E. coli CCll8(pOV25) (pOV25 carries a TnphoA insertion 
in iutA which leaves intact iucD; see Fig. 4); 3, E. coli 
CCll8(pOV33); 4, E. coli CCll8(pOV99); 5, E. coli CC118(pOV48); 
6, £. coli CC118(pOV28); 7, E. coli CC118(pOV7); 8, E. coli 
CCll8(pOV9). The protein products IucD, the four lucD'-'PhoA 
fusions, and the two IucD'-' LacZ fusions can be identified as the 
major bands in lanes 2, 3 to 6, and 7 to 8, respectively. 
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FIG. 6. Accessibility of IucD'-'PhoA and IucD'-'LacZ protein 
fusions to trypsin in spheroplasts. A Coomassie-blue stained SDS- 
PAGE system (8% acrylamide) loaded with spheroplasts from cells 
harboring gene fusion plasmids is shown. Numbers on top refer to 
the number of the O V fusion plasmid carried in each case by E. coli 
CC118 cells. Treatment of the spheroplasts with trypsin is indicated 
above each lane. Cells with the construction pOV25 were intro- 
duced as a control: pOV25 carries a TnphoA insertion in iutA which 
leaves intact iucD (see Fig. 5). Lanes M contained molecular weight 
standards (in thousands). 

terminus of a 'LacZ protein fusion can serve to anchor a 
functional (3-gaIactosidase to the inner membrane of E. coli 
(12). 

Since the leading region of 14 and 21 amino acids in the 
fusions OV33 and OV99, respectively, appears to be enough 
to promote the export of their PhoA moiety, it is likely that 
the leading peptide is also able to promote interactions of the 
IucD product itself with the membrane. The leading region 
of at least 14 amino acids might define a first domain of 
attachment of this protein to the cytoplasmic membrane. 

The levels of alkaline phosphatase afforded by the PhoA + 
fusions encoded by pOV48 (TnphoA insertion at residue 160 
of IucD) and pOV28 (at residue 357) were much lower than 
the levels detected in the presence of pOV33 and pOV99 
(Table 2), although the corresponding proteins were equally 
stable and synthesized in similar, quantities (Fig. 5). 

Two iucD'-'lacZ gene fusions were prepared in vitro by 
hgating the 'lacZ gene to residues 42 and 380 of IucD 
sequence in plasmid pVLN12, generating, respectively, 
piasmids pOV7 and pOV9. Both of them yielded rather high 
P-galactosidase activities in E. coli CC118 cells, which lack 
lactose permease. It is remarkable that permeabilized cells 
with fusion plasmid pOV9 increased only 2.5 times their 
p-galactosidase activity levels compared with intact cells, 
while the corresponding increase detected in cells with 
pOV7 was over 12-fold. In lacY cells the activity of cyto- 
plasmic free P-galactosidase increased usually over 10 times 
after permeabilization (not shown). p-Galactosidase values 
suggested that the 'LacZ moiety in the pOV9 protein fusion 
is facing the periplasm, while that of pOV7 is facing the 
cytoplasm. Both hybrid proteins were stable and synthesized 
in similar amounts, but the total p-galactosidase activity 
levels were some 10-fold higher with pOV7 than with pOV9. 

A final set of assays was carried out to check whether 
domains of the IucD'-'PhoA and IucD'-'LacZ fusion pro- 
teins could be facing the periplasm. For this purpose, 
spheroplasts from cells carrying the fusions were prepared 
and treated with trypsin, and their protein profiles were 
compared with those of untreated spheroplasts (Fig. 6). 
Although spheroplasts from cells carrying pOV25 with the 
functional iucD gene did not reveal a significant degradation 
of the approximately 50-kDa IucD protein, all four IucD'- 



'PhoA fusions appeared to be sensitive to trypsin, since 
analysis in an SDS-PAGE system showed that the corre- 
sponding bands disappeared at high trypsin concentrations 
(not shown) or at least faded after protease treatment (low 
trypsin concentration, Fig. 6). With respect to the IucD'- 
'LacZ fusions, Fig. 6 also shows that the fusion from pOV9 
was heavily degraded, while that of pOV7 remained virtually 
intact. These last two fusions (OV7 and OV9) provided 
reasonable controls for the performance of the procedure, 
because cytoplasmic PhoA fusions are unstable (38) and 
would therefore be unreliable as controls in this type of 
experiment. 

It is known that fusions of some exported proteins to 
'LacZ are lethal to E. coli cells when produced in large 
amounts, a phenomenon explained as jamming of the export 
machinery and concomitant accumulation of precursor 
forms of other exported proteins" (5, 29, 40, 50). Although the 
signal sequences of the corresponding proteins (i.e., alkaline 
phosphatase, maltose-binding protein, etc.) are capable of 
directing p-galactosidase to the membrane, the export is not 
completed and the hybrid proteins become membrane 
bound. Since the 'LacZ moiety was never found translo- 
cated into the periplasm, it was considered that p-galacto- 
sidase is not competent to cross through the lipid bilayer 
(40). In our case, the small difference found in p-galacto- 
sidase activity between permeabilized and intact cells con- 
taining plasmid pOV9, together with the apparent accessi- 
bility of the same pOV9-encoded hybrid protein to trypsin 
digestion in spheroplasts, indicate that the 'LacZ moiety has 
been mobilized into the periplasm, and therefore that p- 
galactosidase can indeed be translocated. This also shows 
that membrane-bound proteins fused to 'LacZ can be syn- 
thesized in large amounts, apparently without deleterious 
consequences. Probably the IucD product is localized in or 
at the inner membrane independently of the export machin- 
ery, and this fact circumvents the problem arising from 
blockage of the export pathway detected in the study of 
some other hybrid proteins (5, 29, 40). 

The previous results support the association of the iucD 
gene product with the inner membrane of E. coli cells, with 
the bulk of the protein facing the inner side and with at least 
two attachment sites which extend into the lipid bilayer. One 
of them would be at the leading sequence of the protein and 
is defined by the 'phoA fusions of pOV33 and pOV99, while 
the other occurs towards its terminal portion and is defined 
by the lacZ fusion of pOV9. This last domain may also 
account for the 'phoA fusion of pOV28. Although the 
" peptide sequences which probably protrude into the mem- 
brane afford translocation of the 'PhoA and 'LacZ moieties 
of those fusions into the periplasm, there is no evidence that 
the IucD product itself has any domain exposed to the 
periplasm. An additional region of attachment to the mem- 
brane could occur in the region close to residue 160, ac- 
counting for the PhoA fusion of pOV48, since cells with that 
plasmid show detectable alkaline phosphatase activity and 
the corresponding fusion protein is sensitive to trypsin in 
spherorMasts. The low activity of the IucD'-'PhoA proteins 
encoded by pOV48 and pOV28 may result from an unusual 
type of association with the membrane that could affect 
some of the steric requirements for activation of alkaline 
phosphatase (39). 

The location of the IucD product makes biological sense. 
As this enzyme is involved in iron metabolism and probably 
requires molecular oxygen for its activity (3), its membrane 
location would contribute to maintenance of a reducing 
intracellular milieu. 
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Important features of the gene product of iucD remain 
uncertain, mostly those regarding its enzymology.We think 
that the information on the gene sequence and the charac- 
teristics of the protein described here will contribute to the 
elucidation of a rational scheme for its purification and, 
ultimately, to the discovery of inhibitors of its activity. 
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